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DATA REPLICATION AMONG STORAGE SYSTEMS 

CROSS-REFERENCES TO RELATED APPLICATIONS 
[0001] The present application claims priority to Japanese Patent Application No. 

2003 183734, filed on June 27, 2003. 

BACKGROUND OF THE INVENTION 
[0002] The present invention relates to a storage system, and more particularly to data 

replication among a plurality of storage systems. 

A technology relating to data replication among storage systems becomes important for 
providing uninterrupted service to customers even when failure occurs in a first storage 
system. The following patent specifications have disclosed a technology for replicating the 
information stored in a first storage system to a second and third storage system. 

[00031 US Patent No. 5,1 70,480 discloses a technology by which a first computer 

connected to a first storage system transfers data stored in a first storage system to a second 
computer via a communication link between a first computer and the second computer, and 
the second computer transfers the data to a second storage system connected to the second 
computer. 

[0004] US Patent No. 6,209,002 discloses a technology by which a first storage 

system transfers data stored in a first storage system to a second storage system and the 
second storage system transfers the data to a third storage system. A computer is connected 
to the first storage system by a communication link, the first storage system is connected to 
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the second storage system by a communication link, and the second storage system is 
connected to the third storage system by a communication link. The first storage system 
holds a first logical volume which is the replication object. The second storage system holds 
a second logical volume which is a duplicate of the first logical volume and a third logical 
volume which is a duplicate of the second logical volume. The third storage system holds a 
fourth logical volume which is a duplicate of the third logical volume. As described in the 
patent specification, the second storage system executes the process for data replication 
exclusively from the second logical volume to the third logical volume and the process for 
data replication from the third logical volume to the fourth logical volume. 
[0005] With the technology disclosed in US Patent No. 5,1 70,480, the first and 

second computers are used all the time. The first computer conducts the usual operations and 
the load of the data replication process on the first computer cannot be ignored. Another 
problem is extending the time for data reference and data update necessary for normal 
operation. Because the data for replication use a communication link between the first 
computer and the first storage system, this data transfer collides with the data transfer 
necessary for normal operation. 

[0006] With the technology disclosed in US Patent No. 6,209,002, a storage capacity 

twice as large as the quantity of the replicated data is required for the second and third 
storage systems. Furthermore, because a large quantity of the data is the object of replication, 
a long time is spent on the data replication process and the data of the third storage system 
becomes obsolete. As a result, a problem arises when the operation is restarted using the data 
of the third storage system: a long time is required to update the data of the third storage 
system and the time to restart the operation is extended. Furthermore, as described in this 
specification, after the first storage system has performed process of data updating in the first 
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storage system and process of data advancement between the first storage system and the 
second storage system, the first storage system sends a data update completion report to the 
host computer. Therefore, a long time is spent on updating data from the computer, and the 
time spent on updating data increases with the distance between the first storage system and 
the second storage system. As a result, another problem associated with the technology 
disclosed in the '002 patent is that the distance between the storage systems cannot be 
increased significantly. 

BRIEF SUMMARY OF THE INVENTION 
[0007] It is an object of the present invention to transfer or replicate data between a 

plurality of storage systems, without affecting the host computer of the storage systems and 
also without affecting communication between the storage systems and the host computer. 
[0008] Another object of the present invention is to enable the reduction of data 

storage areas provided in a plurality of storage systems. Yet another object is to transfer or 
replicate data among a plurality of storage systems effectively, at high speed and in a 
seamless manner, so that the operation of the host computer of the plurality of storage 
systems is not affected. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0009] FIG. 1 is a block diagram illustrating a logical configuration of the first 

embodiment of the present invention; 

[0010] FIG. 2 is a block diagram of the storage system of the first embodiment of the 

present invention; 
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[001 1 1 FIG. 3 illustrates the relationship between the update information and the write 

data of the first embodiment of the present invention; 

(0012] FIG. 4 illustrates an example of volume information of the first embodiment 

of the present invention; 

[0013] FIG. 5 illustrates an example of pair information of the first embodiment of 

the present invention; 

[0014] FIG. 6 illustrates an example of group information of the first embodiment of 

the present invention; 

[0015[ FIG. 7 illustrates an example of pointer information of the first embodiment of 

the present invention; 

10016] FIG. 8 illustrates a structure of the journal logical volume of the first 

embodiment of the present invention; 

[0017] FIG. 9 is a flow chart illustrating a procedure for initiating data replication of 

the first embodiment of the present invention; 

[0018] FIG. 1 0 is a flow chart illustrating an initial copying process of the first 

embodiment of the present invention; 

[0019] FIG. 1 1 illustrates a command reception process of the first embodiment of the 

present invention; 

[0020] FIG. 1 2 is a flow chart of the command reception process of the first 

embodiment of the present invention; 

[0021] FIG. 13 is a flow chart of a journal creation process of the first embodiment of 

the present invention; 

[0022] FIG. 14 illustrates a journal read reception process of the first embodiment of 

the present invention; 
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[0023 1 FIG. 1 5 is a flow chart of the journal read reception process of the first 

embodiment of the present invention; 

[0024] FIG. 16 illustrates a journal read command process of the first embodiment of 

the present invention; 

[0025] FIG. 1 7 is a flow chart of the journal read command process of the first 

embodiment of the present invention; 

[0026] FIG. 18 is a flow chart of a journal store process of the first embodiment of the 

present invention; 

[0027J FIG. 1 9 illustrates a restore process of the first embodiment of the present 

invention; 

|0028] FIG. 20 is a flow chart of the restore process of the first embodiment of the 

present invention; 

[0029] FIG. 21 illustrates an example of update information of the first embodiment 

of the present invention; 

[0030] FIG. 22 illustrates an example of update information during the journal 

creation process of the second embodiment of the present invention; and 
[0031] FIG. 24 illustrates the third embodiment of the present invention, 

DETAILED DESCRIPTION OF THE INVENTION 
[0032] The embodiments of the data processing system in accordance with the 

present invention are described below in greater detail with reference to the appended 
drawings. 

[0033] FIG. 1 is a block diagram illustrating a logical configuration of the first 

embodiment of the present invention. 
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[0034| In the configuration of the first embodiment of the present invention, a host 

computer 1 80 and a first storage system 1 00A are connected by a connection path 1 90, and 
the storage system 1 00A is connected to a second storage system 100B holding a duplicate of 
the data retained in the storage system 1 00A with a connection path 200. In the explanation 
below, in order to distinguish between the first storage system 100 holding the data which is 
the object of replication and the second storage system 100 holding the replicated data, we 
call the storage system 1 00 holding the data, which is the object of replication, the original 
storage system 100A, and the storage system 100 holding the replicated data the secondary 
storage system 1 00B. The storage area of the storage system is managed by partitioning and 
we call the partitioned storage areas logical volumes 230. As shown in FIG. 1 , the logical 
volumes include original l ogical volume 1 230-1 con taining DATA 1-S3Q-K original lo g ic al 
volume 2 230-2 containing DATA2^Q-3, journal logical volume 4 230-4 containing JNL1 
230 - 3 , and log ical v olume 3 230 -3 con taining DATA3^Q3-4 in the original storage system A 
100A; and secondary logical volume 1 230-5 containing COPY 1-2-3-0-5, secondary logical 
volume 2 230-6 contain ing COPY2^BO-6, secondar y journal volume 230-8 cont ainin g JN L2 
230 - 7 , and secondary logical volume 3 230-7 containing DATA1-2-30-8 in the secondary 
storage system B 100B. 

[0035] The capacity of the- each logical volume 230 and the physical storage location 

(physical address) in storage system 100 can be specified using host computer 1 80 or a 
maintenance terminal of a computer connected to storage system 1 00. The physical address 
of each logical volume 230 is retained in the volume information 400 described below with 
reference to FIG. 4 . The physical address is, for example, a number (storage device number) 
identifying a storage device 1 50 (see, FIG. 2) in storage system 100 and a numeric value 
uniquely indicating a storage area in the storage device, for example, a position from the head 
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of a storage area of the storage device. In the explanation below, the physical address is 
storage device number and a position from the head of a storage area of the storage device. 
The logical volume in the explanation below is a storage area of a single storage device, but a 
single logical volume can be also be associated with storage areas of a plurality of storage 
devices by converting the logical addresses and physical addresses. 
|0036l Reference and updating of the data retained by storage system 1 00 can be 

uniquely specified by a number identifying the logical volume and a numeric value uniquely 
indicating the storage area, for example, a position from the head of the storage area of the 
logical volume. A set of numbers comprising a logical volume number and a position from 
the head of the storage area of the logical volume (position in a logical address) are called a 
logical address. 

10037] Jn the explanation below, in order to distinguish between the data which is the 

object of replication and the replicated data, a logical volume 230, which is the object of 
replication, is called an original logical volume, and a logical volume 230, which is the 
replicated data, is called a secondary logical volume. The set of logical volumes comprising 
an original volume and a secondary volume is called a pair. The relationship and status of an 
original logical volume and a secondary logical volume is retained in the pair information 
500 described below with reference to FIG. 5 . 

10038] A management unit referred to as a group is defined. The management unit is 

configured to protect the update sequence of data between the logical volumes. For example, 
assume that host computer 180 will update data 1 of original logical volume 1 230- h then 
reading the data 1 and ttpthrttiii^ update data 2 of original logical volume 2 230-2 using a 
numeric value of data 1 . When replicating data from original logical volume 1 2 30-1 to 
secondary logical volume 1 230-5 (COPY ! ) and replicating data from original logical volume 
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2 230-2 to secondary logical volume 2 2 30- 6 (C O P Y 2) are carried out independently, the 
process of replicating data 2 to secondary logical volume 2 230-6 (COPY2) is sometimes 
conducted prior to the process of replicating data 1 to secondary logical volume 1 230-5 
(COPY! ) . When a process of replicating data 1 of volume 230-1 to secondary logical 
volume 1 230-5 (COPY ! ) is stopped, e.g., due to an accident, in the process of replicating 
data 2 of volume 230-2 to secondary logical volume 2 230-6 (COPY2) and replicating data 1 
to secondary logical volume 1 230-5 (COPY1 the consistency of the data of secondary 
logical volume 1 230-5 (COPY1 ) and secondary logical volume 2 230-6 (COPY2) is lost. In 
this case, too, in order to maintain the consistency of data of secondary logical volume 1 230- 
5 (COPY I ) and secondary logical volume 2 230-6 (COPY 2) , a logical volume 230-4 (JN LI) , 
which is required to protect the update sequence of data, is recorded in the same group and 
replicated in the secondary logical volume in the order of update numbers by allocating the 
update numbers of - group group information 600 (described below wi t h ref erence to F IG . 6 ) 
for each data update. For example, as shown in FIG. 1 , original logical volume 1 230-1 
(DATA1) and original logical volume 2 230-2 (DATA2) of original storage system 100A 
constitute group 1 . Secondary I-logical volume 1 230-5 (COPY1), which is the duplicate of 
original logical volume 1 230-K DATA1), and secondary logical volume 2 230-6 (COPY2), 
which is the duplicate of original logical volume 2 230-2 (DATA2), constitute group 1 in the 
secondary storage system 100B . 

|0039] When the data of the original logical volume, which is the object of data 

replication, is updated, the journal described below is created and retained in the logical 
volume 4 230-4 ( JNL 1 ) in original storage system 100A for updating the data of the 
secondary logical volume. In the explanation of the present embodiment, a logical volume 
(referred to here as a journal logical volume 230-4 ), retaining only a journal (JNL), is 
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allocated to each group. In the configuration shown in FIG. 1 , a logical volume 4 230-4 
(JNL1) is allocated to group 1 of storage system 100A . 

|0040] A journal logical volume 230-8 (JN L2) is also allocated to tt-groupJ_ of 

secondary storage system 100B. The journal logical volume 230-8 is used to retain a journal 
(JNL2) transferred from original storage system 100A to secondary storage system 100B. 
Retaining a journal in a journal logical volume makes it possible not to update data of the 
secondary logical volume during journal reception, for example, when the load on the 
secondary storage system 1 00B is high, and to wait to update the data of the secondary 
logical volume until the load on the secondary storage system 100B is low. Furthermore, in 
the case of a plurality of connection lines 200, multiple transfers of the journal from original 
storage system 1 00A to secondary storage system 1 00B can be made and the transfer capacity 
of the connection lines 200 can be used effectively. It is possible that many journals will 
remain in secondary storage system 100B due to the update sequence, but writing the journals 
that cannot be used directly for data update of the secondary logical volume in the journal 
logical volume makes it possible to free the cache memory. In the configuration shown in 
FIG. 1 , a logical volume 230-8 (JNL2) is allocated to group 1 in the secondary storage system 
I00B . 

[0041] A journal is composed of write data and update information. The update 

information is used to manage the write data and is composed of the time at which a write 
command was received, a group number, an update number of the group information 600 
described below, the logical address of the write command, the data size of the write data, the 
logical address of the journal logical volume storing the write data, and the like. The update 
information may hold only one of either the time at which a write command was received or 
the update number. When the creation time of the write data is present in the write command 
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from host computer 1 80, the creation time may be used instead of the time at which a write 
command was received. An example of the journal update information is described below 
with reference to FIG. 3 and FIG. 21 . A write command received at 22 hours, 20 minutes and 
10 seconds on March 17, 1999. is stored as update information 310. This write command 
stores the write data 320 in position 200701 from the head of the storage area of logical 
volume number 1 230-1 , the data size being 300. The write data 330 of the journal is stored 
at position 1 500 from the head of the storage area of logical volume number 4 230-4 (journal 
logical volume JNL1 ). The logical volume of logical volume number 1 230-1 belongs to 
group 1. As illust rated in FIG. 21. this This-journal indicates that is associated with the 
fourth data update since the data replication initiation of group 1 . 

[0042 J The journal logical volume 4 230-4 , for example, as shown in FIG. 3, is used 

by dividing the volume into a storage area containing update information and a storage area 
containing write data. In the update information area, the information is stored from the head 
of the update information area in the order of update numbers, and if the end of the update 
information area is reached, the information is stored from the head of the update information 
area. In the write data area, the write data is stored from the head of the write data area, and 
if the end of the write data area is reached, the data is stored from the head of the write data 
area. The ratio of the update information area and the write data area may be a fixed value or 
may be set from the maintenance terminal or host computer 180. The aforesaid information 
is held in the pointer information 700- 701 described below with reference to FIG. 7 . The 
journal logical volume, according to the explanation below, is used by dividing the volume 
into update information and write data areas, but a system for continuously storing the journal, 
that is, the update information and write data, from the head of the logical volume may be 
also used. 
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[0043] The operation for copying the data update of the primary or original logical 

volume stored in storage system 100A to the secondary logical volume of storage system 
100B is outlined below with reference to FIG. 1 . 

[0044] (1) If the original storage system 100A receives a write command from the 

host computer 180 relating to data in the original logical volume, the data update in the 
original logical volume 230- 1 (DATA1) and journal retention in the journal logical volume 
23 0-4 (JNLI) are controlled by the command reception process 210 and read-write process 
220 . as illustrated by the arrow 270 in FIG. 1 . 

[0045] (2) The secondary storage system 100B reads the journal from the original or 

primary storage system 100A by the journal read process 240 and retains the journal in the 
journal logical volume (JNL2 230-7) by the read- write process 220 in FIG. 1 . 
[0046] (3) If original storage system 1 00A receives a command for reading the 

journal from secondary storage system 100B, the journal is read from journal logical volume 
230-4 (JNLI) and transmitted to secondary storage system 100B by the command reception 
process 210 and read-Wwrite process 220 . as illustrated by the arrow 280 in FIG. 1 . 
[0047] (4) Secondary storage system 100B reads the journal from journal logical 

volume 230-8 (JNL2) in ascending order of update numbers using pointer information 701 
TOO and updates the date of secondary logical volume 1 230-5 (COPY1) by the restore 
process 250 read-write process 220. as illustrated by arrow 290 in FIG. 1 . 
[0048J The internal structure of storage system 1 00 is shown in FIG. 2. Storage 

system 100 comprises one or more host adapters 1 10, one or more disk adapters 120, one or 
more cache memory units 130, one or more shared memory units 140, one or more storage 
devices 1 50, one or more common paths 1 60, and one or more connection lines 1 70. Host 
adapters 1 10, disk adapters 120, cache memory units 130, and shared memory units 140 are 



11 



APPL. NO. 10/650,338 SUBSTITUTE SPECIFICATION - MARKED-UP COPY 

connected with each other by common paths 160. Common paths 1 60 may be duplexed to 
prevent a failure of a common path 160. Disk adapters 120 and storage devices 150 are 
connected by connection lines 1 70. Maintenance terminals configured to perform the setting, 
monitoring, and maintenance of storage system 100 are connected by special lines to all the 
host adapters 1 1 0 and disk adapters 120 (this connection is not shown in the figure). 
[00491 Host adapters 1 1 0 control data transfer between host computer 1 80 and cache 

memory 1 30. Host adapters 1 1 0 are connected to the host computer 1 80 or to the other 
storage systems 100 by the respective connection lines 190 and 200. Disk adapters 120 
control data transfers between cache memory 1 30 and storage devices 1 50. Cache memory 
1 30 is a memory for temporarily storing data received from host computer 1 80 or data read 
from storage device 1 50. Shared memory 140 is a memory shared by all the host adapters 
1 10 and disk adapters 120 in storage system 100. 

[0050] The volume information 400 manages the logical volume and holds a volume 

state, a format, a capacity, a pair number, and a physical address. FIG. 4 shows an example 
of volume information 400. Volume information 400 is retained in a memory referable from 
host adapter 1 10 and disk adapter 120, for example, in shared memory 140, for management 
purposes. Any of "normal", "original", "secondary", "abnormal", and "blank" is held as a 
volume state. The logical volume 230 with a "normal" or an "original" volume state 
represents the logical volume which is normally accessible from host computer 180. Access 
from host computer 180 may also be permitted to the logical volume with a "secondary" 
volume state. Logical volume 230 with an "original" volume state represents the logical 
volume for which replication of data is conducted. Logical volume 230 with a "secondary" 
volume state represents the logical volume that is used for replication. Logical volume 230 
with an "abnormal" volume state represents the logical volume which is normally 
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inaccessible due to failure. The failure is, for example, a breakdown of storage device 1 50 
holding logical volume 230. Logical volume 230 with a "blank" volume state represents the 
logical volume that has not been used. The pair number is effective when the volume state is 
"original" or "secondary" and holds a pair number for specifying pair information 500. In the 
example shown in FIG. 4, logical volume 1 shows that the format is OPEN3, the capacity is 3 
GB, and the data has been stored from the head of the storage area of storage device 1 50 of 
the storage device number 1, can be accessed and is the data replication object, 
[0051 ] Pair information 500 is the information for managing the pair and contains a 

pair state, an original storage system number, an original logical volume number, a secondary 
storage system number, a secondary logical volume number, a group number, and a copy 
complete address. FIG. 5 shows an example of pair information 500. Pair information 500 is 
retained in a memory referable from host adapter 1 10 and disk adapter 120, for example, a 
shared memory 140. Any of "normal", "abnormal", "blank", "not copying", and "copying" is 
retained as a pair state. The "normal" pair state shows that data replication of original logical 
volume 230 has been carried out normally. The "abnormal" pair state shows that replication 
of original logical volume 230 was not carried out due to a failure. The failure was, for 
example, disconnection of connection path 200. The "blank" pair state shows that the 
information of the pair number was not effective. The "copying" pair state shows that the 
initial copy process was undertaken. The "not copying" pair state shows that the initial copy 
process has not yet been conducted. The original storage system number contains the number 
specifying original storage system 100A holding original logical volume 230. The secondary 
storage system number contains the number specifying secondary storage system 100B 
holding secondary logical volume 230. In the case of the original storage system, a group 
number is retained to which the original logical volume belongs, and in the case of the 
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secondary storage system, the group number is retained to which the secondary logical 
volume belongs. The copy complete address is explained in the description of the initial copy 
process below. Pair information 1 in FIG. 5 shows that the data replication object is original 
logical volume 1 of original storage system 1 ; the data replication destination is secondary 
logical volume 1 of secondary storage system 2; and the data replication process was 
conducted normally. 

[0052] Group information 600 contains a group state, a pair set, a journal logical 

volume number, and an update number. FIG. 6 shows an example of group information 600. 
Group information 600 is retained in a memory referable from host adapter 1 10 and disk 
adapter 120, for example, a shared memory 140. Any of "normal", "abnormal", and "blank" 
is held as a group state. The "normal" group state shows that at least one pair state of a pair 
set is "normal". The "abnormal" group state shows that all the pair states of a pair set are 
"abnormal". The "blank" group state shows that the information of the aforesaid group 
number is not effective. In the case of the original storage system, the pair set holds the pair 
numbers of all the original logical volumes which belong to the group indicated by the group 
number. In the case of a secondary storage system, the pair numbers of all the secondary 
logical volumes belonging to the group indicated by the group number are held. The journal 
logical volume number shows the journal logical volume number belonging to the group with 
the aforesaid group number. For the update number, the initial value is 1, and 1 is added if 
data writing is performed with respect to the original logical volume in the group. The 
update number is stored in the journal update information and used for protecting the data 
update order in the secondary storage system 100B. For example, the group information 1 in 
FIG. 6 is composed of pair information 1, 2, original logical volumes 1, 2, and journal logical 
volume 4 and shows that data replication is being performed normally 
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[0053] Pointer information 700 is held for each group, to manage the journal logical 

volume of the group, and contains an update information area head address, a write data area 
head address, an update information newest address, an update information oldest address, a 
write data newest address, a write data oldest address, a read initiation address, and a retry 
initiation address. FIG. 7 and FIG. 8 show examples of pointer information 700 70 1 . The 
update information area head address holds the logical address of the head of the information 
area storing the update information of the journal logical volume. The write data area head 
address holds the logical address of the head of the information area storing the write data of 
the journal logical volume. The update information newest address holds the logical address 
of the head address of the area in which the next update information will be stored. The 
update information oldest address holds the logical address of the head retaining the update 
information of the oldest journal (the update number is small). The write data newest address 
holds the logical address of the head used for holding the write data when a journal is stored 
next. The write data oldest address holds the logical address of the head retaining the write 
data of the oldest journal (the update number is small). The read initiation address and retry 
initiation address are used only in the original storage system 100A for the journal read 
reception process. In the example of pointer information 700 70] in FIG. 7 and FIG. 8, the 
area retaining the journal management information is from the head of the information area of 
logical volume 4 to position 699, and the area retaining the write data of the journal is from 
position 700 to position 2699 in the information area of logical volume 4. The management 
information of the journal is retained from position of 200 to position of 499 in the 
information area of logical volume 4, and the management information of the next journal is 
retained from position 500 in the information area of logical volume 4. The write data of the 
journal is retained from position 1300 to position 2199 in the information area of logical 
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volume 4, and the write data of the next journal is retained from position 2200 in the 
information area of logical volume 4. 

[0054] The explanation below is provided with respect to a mode in which one 

journal logical volume is allocated to one group, but a plurality of journal logical volumes 
may also be allocated to one group. For example, two journal logical volumes are allocated 
to one group; the pointer information 700 701 is provided for each journal logical volume, and 
the journals are stored alternately. As a result, writing into storage device 1 50 of the journal 
can be dispersed and performance is expected to be improved. In addition, read performance 
of the journals is also improved. In another example, two journal logical volumes are 
allocated to one group and usually only one journal logical volume is used. The other journal 
logical volume is used when the performance of the journal logical volume that has been used 
is degraded. Performance degradation occurs, for example, when a journal logical volume is 
composed of a plurality of storage devices 150, data is held in a RAIDS system, and one of 
the storage devices 1 50 has failed. 

[0055] The above-described volume information 400, pair information 500, group 

information 600, and pointer information 700 701 are preferably stored in shared memory 140. 
However, the present embodiment is not limited to this case, and the information of those 
types may be also stored in a centralized or decentralized form in the cache memory 1 30, host 
adapter 1 10, disk adapter 120, and other storage devices 150. 

[0056] The procedure for initiating data replication from the original storage system 

100A to the secondary storage system 100B is described below with reference to FIG. 9 and 
FIG. 10. 

[0057] (1) Explanation of group creation (step 900). A user refers to group 

information 600 of original storage system 100A by using the maintenance terminal or host 
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computer 1 80 and acquires the group number A for which the group state is "blank". The 
user specifies the group number A by using the maintenance terminal or host computer 1 80 
and conducts the group creation specification in original storage system 100A. 
|0058] Having received the group creation specification, original storage system 

100A changes the group state with the specified group number A to "normal". 
|0059] Similarly, the user refers to group information 600 of secondary storage 

system 100B and acquires the group number B for which the group state is "blank". The user 
specifies secondary storage system 100B and the group number B and conducts the group 
creation specification in original storage system 100A using the maintenance terminal or host 
computer 1 80. Original storage system 100A transfers the received group creation 
specification to secondary storage system 100B. Secondary storage system 100B changes the 
group state with the specified group number B to "normal". 

|0060] The user may also specify the group number B and conduct the group creation 

specification in secondary storage system 1 00B by using the maintenance terminal of 
secondary storage system 100B or host computer 180 connected to secondary storage system 
100B. 

[0061] (2) Explanation of pair recording (step 910). The user specifies the 

information indicating the data replication object and the information indicating the data 
replication address and conducts the pair record specification in original storage system 100A 
by using the maintenance terminal or host computer 1 80. The information indicating the data 
replication object is the original logical value number and the group number A of the data 
replication object. The information indicating the data replication address is the secondary 
storage system 100B retaining the replicated data, the group number B, and the secondary 
logical volume number. 
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(0062] Having received the above-mentioned pair recording specification, original 

storage system 100A acquires the pair number for which the pair information is "blank" from 
pair information 500 and sets the pair state to "not copying", the original storage system 
number indicating original storage system 100A to the original storage system number, the 
specified original logical volume number to the original logical volume number, the specified 
secondary storage system number to the secondary storage system number, the specified 
secondary logical volume number to the secondary logical volume number, and the specified 
group number A to the group number. Original storage system 100A adds the acquired pair 
number to the pair set of group information 600 of the specified group number A and changes 
the volume state of the original logical volume number to "original". 

[0063] Original storage system 100A specifies the original storage system number 

indicating original storage system 100A, group number B specified by the user, the original 
logical volume number and the secondary logical volume number to secondary storage 
system 100B. Secondary storage system 100B acquires the blank pair number from pair 
information 500 and sets the pair state to "not copying", the original storage system number 
indicating original storage system 100A to the original storage system number, the specified 
original logical volume number to the original logical volume number, the secondary storage 
system number indicating the secondary storage system B to the secondary storage system 
number, the specified secondary logical volume number to the secondary logical volume 
number, and the specified group number B to the group number. 

[0064] Secondary storage system 100B adds the acquired pair number to the pair set 

of group information 600 of the specified group number B and changes the volume state of 
the secondary logical volume number to "secondary". 
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[0065] The above-described operation is performed for the pairs of all the data 

replication objects. 

[0066| The explanation above was provided with respect to the process in which the 

recording of logical volumes into a group and setting of logical volume pairs was performed 
simultaneously, but the volume pairs may be individually recorded. 

[0067J (3) Explanation of journal logical volume recording (step 920) is provided 

with reference to FIG. 9. The user provides the specification (journal logical volume 
recording specification) for recording the logical volume used for journal retention in original 
storage system 1 00 A using the maintenance terminal or host computer 1 80. The journal 
logical volume recording specification is composed of a group number and a logical volume 
number. 

[0068] Original storage system 1 00A records the specified logical volume number in 

the journal logical volume number of group information 600 with the specified group number. 
The volume state of volume information 400 of this logical volume is set to "normal". 
[0069] Similarly, the user refers to volume information 400 of secondary storage 

system 100B, specifies the logical volume numbers used as secondary storage system 100B, 
group number B, and the journal logical volume, and performs the journal logical volume 
recording in original storage system 100A by using the maintenance terminal or host 
computer 180. Original storage system 100A transfers the journal logical volume recording 
specification to secondary storage system 100B. Secondary storage system 100B records the 
specified logical volume number in the journal logical volume number of group information 
600 of the specified group number B. The volume state of volume information 400 of the 
aforesaid logical volume is set to "normal". 
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[0070] The user may also specify the logical volume number used as the journal 

logical volume and the group number and provide the journal logical volume recording 
specification in secondary storage system 100B using the maintenance terminal of secondary 
storage system 100B or the host computer 180 connected to secondary storage system 100B. 
|0071] The above-described operations are carried out with respect to all the logical 

volumes used as the journal logical volumes. The order of steps 910 and 920 can change. 
[0072] (4) Explanation of data replication process initiation (step 930) follows. The 

user specifies the group number for initiating the data replication process and specifies the 
initiation of the data replication process to original storage system 100A by using the 
maintenance terminal or host computer 1 80. Original storage system 100A sets to 0 all the 
copy complete addresses of pair information 400 belonging to the specified group. 



[0073] Original storage system 100A specifies the initiation of the journal read 

process and the restore process to the secondary storage system 100B. 

[0074] Original storage system 100A initiates the initial copying process. 

[0075] (5) Explanation of initial copying process end (step 940). 

[0076] If the initial copying process ends, original storage system 1 00A posts the end 



of the initial copying process to secondary storage system 1 00B. Secondary storage system 
100B changes the pair states of all the secondary logical volumes belonging to the specified 
group to "normal". 

[0077] FIG. 10 is a flowchart of the initial copying process. In the initial copying 

process, the copy complete address of pair information 500 is used with respect to the entire 
storage area of the original logical volume of the data replication object and a journal is 
created for each unit size in the order starting from the head of the storage area. The initial 
value of the copy complete address is 0, and the created data quantity is added up for each 
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journal creation. A journal creation is completed in the initial copying process from the head 
of the storage area of the logical volume to the address which is one prior to the copy 
complete address. Conducting the initial copying process makes it possible to transfer the 
data of the original logical volume that has not been updated into the secondary logical 
volume. In the explanation below, processing conducted by one of host adapters A-j 1 0 in 
original storage system 100A is described, but processing may also be conducted by one of 
disk adapters 120. 

[0078] (1) Host adapter A -j 10 of original storage system 100A obtains original 

logical volume A- 230-1 for which the pair state is "not copying" in the pair belonging to the 
group that is the object of processing, changes the pair state to "copying", and repeats the 
process (steps 1010, 1020). When no original logical volume A- 230-1 is present, the process 
ends (step 1030). 

[00791 (2) In step 1020, when logical volume A- 230-1 is present, the host adapter A 

1 1 0 creates a journal with the unit size (for example, 1 MB) data. The journal creation 
process is described below (step 1040). 

[0080] (3) The host adapter A- l 1 0 adds the data size of the created journal to the copy 

complete address (step 1050). 

[0081] (4) The above-described processing is repeated until the copy complete 

address reaches the capacity of original logical volume A- 230-1 (step 1060). When the copy 
complete address becomes equal to the capacity of original logical volume- A 230- K the 
journal is created with respect to the entire storage area of original logical volume-A 230-1 . 
Thus, the pair state is updated to "normal" and processing of the other original logical 
volumes is initiated (step 1070). 
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|0082| In the above-described flow chart, the explanation was conducted with respect 

to processing logical volumes one by one. However, a plurality of logical volumes may be 
processed simultaneously. 

[0083] FIG. 1 1 illustrates the command reception process 210; FIG. 12 is a the flow 

chart of the command reception process 210, and FIG. 13 is a flow chart of the journal 
creation process. The operation relating to a case in which original storage system 100A 
receives a write command for logical volume 230 of the data replication object from host 
computer 1 80 is explained below with reference to these figures. 

[0084] (1 ) The host adapter A-.1 10 in original storage system 1 00A receives an access 

command from the host computer. The access command includes commands such as read, 
write, and journal read, the logical address which is the object of the command, the quantity 
of data, and the like. In this example, the logical address in the access command is denoted 
by the logical address "A"; the logical volume number is denoted by logical volume number 
of volume 230-1 -A; the location in the logical volume is denoted by allocation "A" in the 
logical volume 230-1 ; and the quantity of data is denoted by data quantity "A" o f the vol ume 
230-1 (step 1 200 of FIG. 12). 

[0085] (2) Host adapter A- l 1 0 examines the access command (steps 1210, 121 5 of 

FIG. 12). When the access command is a journal read command in step 1215 of FIG. 12, the 
journal read reception process is implemented (step 1220 of FIG. 12). When the access 
command is other than the journal read command and write command, for example, when the 
access command is a read command, a read process is implemented in accordance with 
conventional technology (step 1230 of FIG. 12). 

[0086] (3) When the access command is a write command in step 1210 of FIG. 12, 

volume information 400 of logical volume A- 230-1 is referred to and the volume state is 
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examined (step 1240 of FIG. 12). When the volume state of logical volume A-230-1 is other 

than "normal" or "original" in step 1240, access to logical volume A- 2 30- 1 is impossible. As 
a result, an abnormal end is reported to host computer 180 (step 1245 in FIG. 12). 
|0087] (4) When the volume state of logical volume A - 2 3 0-1 is either "normal" or 

"original" in step 1240, host adapter A- 1J (preserves cache memory 130 and notifies host 
computer 1 80 that the preparation for data reception has been made. Host computer 1 80 
receives this notification and transmits the write data to original storage system 1 00A. Host 
adapter A~ j 10 receives the write data and retains it in cache memory 1 30 (steps 1 250, 1 1 00 in 
FIG. 11). 

[0088J (5) Host adapter A- l 10 refers to the volume state of logical volume A- 2 3 0-1 

and determines whether logical volume A-230- 1 is the data replication object (step 1260 of 
FIG. 12). When the volume state is "original" in step 1260, logical volume A- 230-1 is the 
data replication object. Accordingly, the journal creation process is conducted (step 1265 of 
FIG. 12). 

[0089] (6) When the volume state is "normal" in step 1 240, or after completing the 

journal creation process of step 1265, host adapter A- 1 1 0 instructs disk adapter 120 to write 
the write data into storage device 150 (1 140 in FIG. 1 1) and reports the process end to host 
computer 180 (steps 1270, 1280 of FIG. 12). Thus disk adapter 120 retains the write data in 
storage device 150 by read-write process (1 1 10 in FIG. 1 1). 
[0090] The journal creation process is explained below. 

[0091] (1) Host adapter A-l 10 examines the volume state of the journal logical 

volume 230-4 (step 1 3 1 0 in FIG. 1 3). When the volume state of the journal logical volume 
230-4 is "abnormal" in step 1310, storing a journal in the journal logical volume 2 30-4 is 
impossible. Accordingly, the group state is changed to "abnormal" and the process ends (step 
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1315 in FIG. 1 3). In this case, the journal logical volume 230-4 is changed to the normal 
logical volume. 

[0092] (2) When the journal logical volume 230-4 is determined to be "normal" in 

step 1310, the journal creation process continues. The journal creation process differs 
depending on whether it is conducted within the initial copying process or within the 
command reception process (step 1320 in FIG. 13). When the journal creation process is 
executed within the command reception process, the process is conducted from step 1330 in 
FIG. 13. When the journal creation process is executed within the initial copying process, the 
process is conducted from step 1370 in FIG. 13. 

[0093] (3) When the journal creation process is conducted within the command 

reception process, host adapter A-l 10 determines whether the logical address "A" of the write 
object has become the processing object of the initial copying process (step 1330). When the 
pair state of logical volume A-230-1. is "not copying", the journal creation process is executed 
thereafter in the initial copying process. As a result, the process ends without creating a 
journal (step 1335 in FIG. 13). When the pair state of logical volume A- 230-1 is "copying," 
if the copy complete address is equal to or less than position "A^ in the logical address, the 
journal creation process is carried out thereafter in the initial copying process. Accordingly, 
the process ends without creating a journal (step 1 335). In other cases, that is, when the pair 
state of logical volume A-230-1 is "copying" and the copy complete address is no less than 
position ^A" in the logical address, or when the pair state of logical volume A- 230-1 is 
"normal," the initial copying process has already been completed. Therefore, the journal 
creation process continues. 

[0094] (4) Host adapter A-l. 1 0 then determines whether the journal can be stored in 

the journal logical volume. The presence or absence of a blank area in the update 
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information area is ascertained using pointer information 700701 (step 1 340 in FIG. 1 3). 
When the update information newest address and the update information oldest address of 
pointer information 700701 are equal to each other, no blank area exists in the update 
information area. For this reason, the process ends as a journal creation failure (step 1390 in 
FIG. 13). 

[0095j When a blank area is present in the update information area in step 1 340, 

pointer information 700 70 1 is used to determine whether the write data can be stored in the 
write data area (step 1 345 in FIG. 1 3). When the sum of the write data newest address and a 
quantity "A" of the data is equal to or larger than the write data oldest address, storage is not 
possible in the write data area. For this reason, the process ends as a journal creation failure 
(step 1390). 

[0096] (5) When the journal can be stored, host adapter A- 1 10 acquires the logical 

address storing the update number and update information and the logical address storing the 
write data, and creates the update information in cache memory 130. The update number, a 
numeric value acquired from group information 600 of the object group and having 1 added 
thereto, is set to the update number of group information 600. The logical address storing the 
update information is the update information newest address of pointer information 700701 , 
and the numeric value having the size of the update information added thereto is set to the 
update information newest address of pointer information 700 701 . The logical address 
storing the write data is the write data newest address of pointer information 700701 , and the 
numeric value obtained by adding the quantity ^A" of the data to the write data newest 
address is set to the write data newest address of pointer information 700 701 . 
[0097] Host adapter A-j 10 sets the above-described numeric values, group number, 

the time at which the write command was received, the logical address "A" in the write 
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command, and quantity "A" of the data, to the update information (step 1350 in FIG. 13; 
1 120 in FIG. 1 1). For example, when a write command of data with a size of 100 has been 
received in position 800 from the head of the storage area of the original logical volume 1 
230-1 of group 1 in a state of group information 600 shown in FIG. 6 and pointer information 
799 701 shown in FIG. 7, the update information shown in FIG. 22 is created. The update 
number of the group information becomes 5, the update information newest address of the 
pointer information becomes 600 (the size of the update information is 100), and the write 
data newest address becomes 2300. 

[0098] (6) Host adapter A- l 10 instructs disk adapter 1 20 to write the update 

information and the write data into storage device 1 50 and normally ends processing (step 
1360 in FIG 13; 1130, 1140, 1150 in FIG. 11). 

[0099] (7) When the journal creation process is within the initial copying process, the 

process of step 1370 in FIG. 13 is conducted. Host adapter A-110 determines whether the 
journal can be created. The presence or absence of a blank area of update information is 
examined using pointer information 790701 (step 1370). When the update information 
newest address and the update information oldest address of pointer information 70 970 1 are 
equal to each other, no blank area is present in the update information area. For this reason, 
the process ends as a journal creation failure (step 1390). In the case of the initial copying 
process shown in the present embodiment, the write data of the journal is read from the 
original logical volume, and because the write data area is not used, confirms the blank area 
of the write data area is not necessary. 

[0100] (8) When creation of the journal is ascertained as possible at step 1370, host 

adapter A-110 acquires the numeric value that will be set in the update information and 
creates the update information in cache memory 1 30. The update number, a numeric value 
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acquired from the group information 600 of the object group and having 1 added thereto is set 
to the update number of the group information 600. The logical address storing the update 
information is the position of the update information newest address of pointer information 
-700701 , and the numeric value having the size of the update information added thereto is set 
to the update information newest address of pointer information r 7OO70j . 
[0101] Host adapter A- T 1 0 sets the above-described acquired numeric values, the 

group number, the initiation time for this process, the logical address of the initial copying 
process object, a processing quantity of one initial copy, and the logical address of the initial 
copying process object to the logical address of the journal logical volume storing the write 
data (step 1380 in FIG. 13; 1120 in FIG. 11). 

[0102] (9) Host adapter A -l 10 instructs disk adapter 120 to write the update 

information into storage device 150 and normally ends the process (step 1385 in FIG. 13; 
1140, 11 60 in FIG. 11). 

[01031 I n the explanation above, a case was described in which the update 

information was present in cache memory 130, but the update information also may be stored 
in shared memory 140. 

[0104] Writing the write data into storage device 1 50 may be conducted 

asynchronously, that is, not directly after steps 1360 and 1385. However, if host computer 
1 80 again issues a write command to logical address ^A^, the write data of the journal is 
overwritten. Therefore, the write data of the journal has to be written into a storage device 
150 corresponding to the logical address of the journal logical volume of update information 
before write data from host computer 1 80 is received. Alternatively, writing into a storage 
device 1 50 corresponding to the logical address of the journal logical volume of update 
information may be conducted after the write data is moved in another cache memory. 
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[0105] In the above-described journal creation process, the journal was assumed to be 

retained in storage device 150, but it is also possible to prepare a constant-capacity cache 
memory 130 for the journal in advance and to retain the journal in storage device 150 after 
this cache memory has been entirely used. The cache memory capacity for the journal is 
specified, for example, from the maintenance terminal. 

|0106] Read-write process 220 is implemented by disk adapter 120 upon reception of 

a command from host adapter 1 10 or disk adapter 120. The process to be implemented 
includes a process of writing the data of the specified cache memory 1 30 into a storage area 
in storage device 1 50 corresponding to the specified logical address and a process of writing 
the data into the specified cache memory 1 30 from the storage area in storage device 1 50 
corresponding to the specified logical address. 

[0107| FIG. 14 explains the operation (journal read reception process) of host adapter 

A- 1 1 0 of original storage system 100A that has received a journal read command. FIG. 15 is 
a flow chart of this operation. An operation relating to a case in which original storage 
system 100A has received a journal read command from secondary storage system 100B is 
explained below with reference to these figures. 

[0108] (1) Host adapter A-j 10 in original storage system 100A receives an access 

command from secondary storage system 100B. The access command comprises an 
identifier indicating that this is a journal read command, a group number of the command 
object, and the presence or absence of retry specification.^ The group number in the access 
command isdenoted below as group number "A" (step 1220 in FIG. 15; 1410 in FIG. 14). 
[0109| (2) Host adapter A~j 10 determines whether the group state with group number 

"A" is "normal" (step 1510). When the group state is found to be other than "normal", for 
example, a "failure" in the examination conducted in step 1510, the group state is posted to 
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secondary storage system 1 00B and the process ends. Secondary storage system 1 00B 
conducts the process according to the received group state. For example, when the group 
state is "failure," the journal read process ends (step 1515). 

[0110] (3) When the group state of group number "A" is "normal" at step 1510, host 

adapter A- 1 10 examines the volume state of the journal logical volume 2 30 - 4 (step 1 520). 
When the volume state of the journal logical volume is found to be, for example, "failure", 
rather than "normal", in the examination conducted at step 1520, the group state is changed to 
"failure;" the group state is posted to secondary storage system 100B, and the process ends. 
Secondary storage system 1 00B conducts processing according to the received group state. 
For example, when the group state is "failure," the journal read process ends (step 1525), 
[0111] (4) When the volume state of the journal logical volume is "normal" at step 

1520, a determination is made whether the journal read command is a retry specification (step 
1530). 

[0112] (5) When the journal read command is found to be a retry at step 1530, host 

adapter A- 1 10 again transmits the previously transmitted journal to secondary storage system 
100B. Host adapter A- 110 reserves cache memory 130 and instructs the disk adapter to read 
the information on the size of the update information from the retry initiation address of 
pointer information 7-OQ70 1 to the cache memory (1420 in FIG. 14). 

[0113] In the read- write process of the disk adapter, the update information is read 

from storage device 150 and retained in cache memory 130, and host adapter A- 1 1 0 is 
notified to this effect (1430 in FIG. 14). 

[0114] Host adapter A- 1 10 receives the notification of the read end of update 

information, acquires the logical address of write data and the size of the write data from the 
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update information, reserves cache memory 1 30, and instructs the disk adapter to read the 
write data in the cache memory (step 1540 in FIG. 15; 1440 in FIG. 14). 
[01 15] I n the read-write process of the disk adapter, the write data is read from 

storage device 1 50 and retained in cache memory 130, and host adapter A- 1 1 0 is notified to 
this effect (1450 in FIG. 14). 

[0116] Host adapter A- j 1 0 receives notification of the read end of write data, 

transmits the update information and write data to e-secondary storage system 100B, frees 
cache memory 130 holding the journal, and ends the process (step 1545 in FIG. 15; 1460 in 
FIG. 14). 

[01 17] (6) When the journal read command is found not to be a retry specification in 

the examination conducted at step 1530, host adapter A-j 10 examines the presence of a 
journal that has not been transmitted. If such a journal is present, it is transmitted to 
secondary storage system 1 00B. Host adapter A- 1 1 0 compares the read initiation address of 
pointer information 700 701 with the update information newest address (step 1550). 
[0118] When the read initiation address is equal to the update information newest 

address, the transmission of the entire journal to secondary storage system 100B is complete. 
Therefore, "no journal" is transmitted to secondary storage system 100B (step 1560) and the 
storage area of the journal that was transmitted to the secondary storage system 100B during 
the previous journal read command is freed (step 1590). 

[0119] The process for freeing the journal storage area sets the retry initiation address 

to the update information oldest address of pointer the-information -74)0 701 . When the update 
information oldest address becomes the write data area head address, the update information 
oldest address is set at "0." The write data oldest address of pointer information 700701 
changes to a numeric value obtained by adding the size of the write data transmitted 
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according to the previous read journal command. When the write data oldest address 
becomes the logical address above the capacity of the journal logical volume 230-4 , the write 
data area head address is decreased and corrected. 

[0120] (7) When the presence of a non-transmitted journal is found in the 

examination conducted in step 1550, host adapter A-j 1 0 reserves cache memory 130 and 
instructs the disk adapter to read the information on the size of update information from the 
read initiation address of pointer information : ?O9701 to the cache memory (1420 in FIG. 14). 
|0121 j In the read-write process of disk adapter-A 1 1 0 , the update information is read 

from storage device 150 and retained in cache memory 130, and the host adapter is notified to 
this effect (1430 in FIG. 14). 

|0122] The host adapter A-l 10 receives the notification of the read end of update 

information, acquires the logical address of the write data and the size of the write data from 
the update information, reserves the cache memory 130, and instructs the disk adapter A- l 10 
to read the write data in the cache memory (step 1 570; 1440 in FIG. 14). 
[0123] In the read-write process of disk adapter-A- 1 1 0 , the write data is read from the 

storage device 1 50 and retained in the cache memory 1 30, and the host adapter is notified to 
this effect (1450 in FIG. 14). 

[0124] Host adapter A- 110 receives notification of the read end of the write data, 

transmits the update information and the write data to the secondary storage system 100B 
(step 1580) and frees cache memory 130 holding the journal (1460 in FIG. 14). Then, it sets 
the read initiation address to the retry initiation address of pointer information 700701 and 
sets a numerical value obtained by adding the update information size of the transmitted 
journal to the read initiation address. 
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[0125] (8) Host adapter A-j 10 frees the storage area of the journal that was 

transmitted to secondary storage system 100B during the previous journal read command 
(step 1590). 

|0126] In the above-described journal read reception process, original storage system 

1 00A transmitted journals one by one to secondary storage system 1 00B. However, a 
plurality of journals may simultaneously be transmitted to 1 secondary storage system 100B. 
The number of journals transmitted in response to one journal read command may be 
specified in the journal read command by secondary storage system 100B and may be 
specified in original storage system 1 00A or secondary storage system 1 00B by the user, e.g., 
during the group creation. Furthermore, the number of journals transmitted in response to 
one journal read command may be changed dynamically according to the transfer capacity, 
load, and the like, of connection path 200 of original storage system 1 00A and secondary 
storage system 100B. 

Further, the journal transfer quantity may also be specified by considering the size of the 
write data of journals, rather than the number of journals. 

(0127] In the above-described journal read reception process, the journals were 

written into cache memory 130 from storage device 150, but if the journals are present in 
cache memory 130, this process is unnecessary. 

(0128] The above-described process for freeing the storage area of the journal in the 

journal read reception process was conducted during subsequent journal read command 
processing, but the freeing of storage area may be also conducted immediately after the 
journal has been transmitted to secondary storage system 100B. Further, secondary storage 
system 100B may set the update number whose storage area can be freed in the journal read 
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command, and original storage system 100A may free the journal storage area according to 
this specification. 

[0129] FIG. 16 explains journal read command process 240. FIG. 17 is a flow chart 

of this process. FIG. 1 8 is a flow chart of a journal store process. The operation in which 
host adapter B-j 10 of secondary storage system 100B reads the journals from original storage 
system 100A and stores them in a journal logical volume 230-8 is described below with 
reference to these figures. 

[0130] (1) Host adapter B- 1 1 0 in secondary storage system 100B reserves cache 

memory 130 storing the journal and transmits to original storage system 100A an access 
command comprising an identifier indicating that this is a journal read command, a group 
number of the command object in original storage system 100A, and the presence or absence 
of a retry specification. The group number in the access command is denoted below as a 
group number IvV; (step 1 700; 1 61 0 in FIG. 1 6). 

[0131 ] (2) Host adapter B-j 10 receives the journal and the response of original 

storage system 100A (1620 in FIG. 16). Host adapter B-1 10 examines the response and, 
when the response from original storage system 100A is "no journal," transmits a read journal 
command to original storage system 100A after a certain interval, because the journal of the 
specified group is not present in original storage system 100A (steps 1720, 1725). 
[0132] (4) When the response of original storage system 100A is "group state is 

failure" or "group state is blank," the group state of secondary storage system 100B is 
changed to the received state, and the journal read process ends (steps 1730, 1735). 
[0133] (5) When the response of original storage system 100A is normal, that is, other 

than the above-described responses, the volume state of journal logical volume 230 -8 is 
examined (step 1740). When the volume state of journal logical volume 230-8 is "abnormal", 
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the journal cannot be stored in the journal logical volume 230-8. Therefore, the group state is 
changed to "abnormal" and the process ends (step 1 745). In this case, the group state is 
returned to normal by changing the journal logical volume 230-8 to the normal logical 
volume. 

|0134] (6) When the volume state of the journal logical volume 230-8 is normal in the 

examination conducted at step 1740, journal store process 1800 is conducted. When the 
journal store process 1 800 ends normally, the next journal read command is transmitted. 
Alternatively the next journal read command is transmitted after a certain interval (step 1760 
in FIG. 1 7). As for the timing for transmitting the next journal command, transmission may 
be conducted periodically at constant intervals or the timing may be determined by the 
number of journals, the traffic on connection line 200, the storage capacity of the journal held 
by secondary storage system 100B, the load on secondary storage system 100B, and the like. 
Furthermore, the information capacity of the journal held by original storage system 100A or 
the pointer information of original storage system 100A may be read from secondary storage 
system 1 00B and the timing may be determined by the numeric value thereof. Transfer of 
this information may be conducted by a special command and may be included in the 
response of the journal read command. In considering the timing for transmitting the next 
journal command and changing a term for transmitting the next journal command, if the 
information capacity of the journal held by the original storage system 1 00A or the 
information capacity of the journal held by the secondary storage system 100B has greater 
quantity, the data processing system can change the information capacity of the journal for 
the storage system 100A or 100B. The subsequent process is identical to that following step 
1710. 
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[0135] (7) When the journal storage process of step 1800 does not end normally, the 

blank area of the journal logical volume 230-8 is not presented. For this reason, the received 
journal is canceled and a journal read command with a retry specification is transmitted after 
a certain interval (step 1 755 in Fig. 1 7). Alternatively, the journal is held in the cache 
memory and the journal store process is conducted again after a certain interval. This is 
because conducting restore process 250 (described below) makes it possible to add the blank 
area to the journal logical volume after a certain interval. With such a method, the presence 
of a retry specification in the journal read command is unnecessary. 
[0136] Journal store process 1800 illustrated in FIG. 18 is explained below. 

|0137] (1) Host adapter B-1J0 determines whether the journal can be stored in the 

journal logical volume 230 -8. The presence of a blank area in the update information area is 
examined using pointer information -700 7 0 1 (step 1810). When the update information 
newest address and the update information oldest address of pointer information 700701 are 
equal to each other, no blank area is present in the update information area. For this reason, 
the process ends as a journal creation failure (step 1 820). 

[0138] (2) When a blank area is present in the update information area in step 1810, 

an examination is conducted about whether the write data can be stored in the write data area 
by use of pointer information 700701 (step 1 830). When the sum of the write data newest 
address and the quantity of write data of the received journal is equal to or larger than the 
write data oldest address, the write data cannot be stored in the write data area. For this 
reason, the process ends as a journal creation failure (step 1820). 

[0139] (3) When the journal can be stored, host adapter B~ l 10 changes the group 

number and the logical address of the journal logical volume 230-8 in the received update 
information. The group number is changed to the group number "B" of secondary storage 
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system 100B, and the logical address of the journal logical volume is changed to the write 
data newest address of pointer information -706 701 . Host adapter B- JJO changes the update 
information newest address of pointer information 79 0701 to a numeric value obtained by 
adding the size of update information to the update information newest address. Host adapter 
B- l 1 0 changes the write data newest address of pointer information 700 7 01 to a numeric 
value obtained by adding the size of write data to the write data newest address (step 1 840). 
[0140] (4) Host adapter B-l 10 instructs disk adapter 120 to write the update 

information and the write data in storage device 150 and ends the process as a journal 
creation success (step 1 850 in FIG. 18; 1630. in FIG. 16). Disk adapter 120 then writes the 
update information and the write data into storage device 150 by the read-write process and 
frees cache memory 130 (1640 in FIG. 16). 

[0141 J In the above-described journal store process, the journal was assumed to be 

retained in storage device 150, but it is also possible to prepare a constant-capacity cache 
memory 130 for the journal in advance and to retain the journal in storage device 150 after 
the entire cache memory has been used. The cache memory capacity for the journal is 
specified, for example, from the maintenance terminal. 

[0142] FIG. 19 explains restore process 250. FIG. 20 is a flow chart of this process. 

The operation in which host adapter B- 1 1 0 of secondary storage system 100B uses the journal 
and conducts data update is described below with reference to these figures. Restore process 
250 may be also conducted by e-disk adapter 120 of secondary storage system 100B. 
[0143] (1) Host adapter B-l 00 determines whether the group state of group number 

"B" is "normal" (step 201 0). When the group state is found to be other than "normal", for 
example, "failure," at step 2010, the restore process ends (step 2015). 
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[0144] (2) When the group state is found to be "normal" at step 2010, the volume 

state of the journal logical volume 230-8 is examined (step 2020). When the volume state of 
journal logical volume 230-8 is found to be "abnormal" at step 2020, access is impossible. 
As a result, the group state is changed to "abnormal" and the process ends (step 2025). 
[0145] (3) When the volume state of journal logical volume 230-8 is found to be 

"normal" at step 2020, examination is made to of whether the journal of restore object is 
present. Host adapter B~ 1 1 0 acquires the update information oldest address and the update 
information newest address of pointer information 700 701 . When the update information 
oldest address and the update information newest address are equal to each other, no journal 
is present. Accordingly, the restore process is temporarily stopped and resumed after a 
certain interval (step 2030). 

[0146] (4) When the journal of restore object is present m-t+ie-at step 2030, the 

following process is conducted with respect to the journal having the oldest (minimum) 
update number. The update information of that journal is retained from the update 
information oldest address of pointer information 700701. Host adapter B- l 10 reserves cache 
memory 130 and instructs the disk adapter to read the information on the size of the update 
information in cache memory 1 30 ( 1 9 1 0 in FIG. 1 9). 

[0147] In the read-write process of the disk adapter, the update information is read 

from storage device 1 50 and retained in cache memory 1 30, and host adapter B- l 10 is 
notified to this effect (1920 in FIG. 19). 

[0148] Host adapter £H 10 receives the notification of the read end of the update 

information, acquires the logical address of the write data and the size of the write data from 
the update information, reserves cache memory 1 30, and instructs the disk adapter to read the 
write data in cache memory (1930 in FIG. 19). 
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[0149] In the read-write process of the disk adapter, the write data is read from 

storage device 1 50 and retained in cache memory 1 30, and the host adapter is notified to this 
effect (step 2040 of FIG. 20; 1940 in FIG. 19). 

[0150] (5) Host adapter B-j 10 finds the logical address of the secondary logical 

volume that will be obtained from the update information and instructs the disk adapter to 
write the write data into the secondary logical volume 230-5 (step 2050 in FIG. 20; 1 950 in 
FIG. 19). In the read-write process of the disk adapter, the data is written into storage device 
1 50 corresponding to the logical address of the secondary logical volume 230-5 ; cache 
memory 130 is freed, and the host adapter is notified to this effect (1960 in FIG. 19). 
[0151 ] (6) Host adapter B- T10 receives the notification of the write process end of the 

disk adapter and frees the journal storage area. The process for freeing the journal storage 
area changes the update information oldest address of the pointer information 700 701 to a 
numeric value obtained by adding the size of update information. When the update 
information oldest address becomes the write data area head address, the update information 
oldest address is set "0". The write data oldest address of pointer information 700701 
changes to a numeric value obtained by adding the size of the write data. When the write 
data oldest address becomes the logical address above the capacity of the journal logical 
volume, the write data area head address is decreased and corrected. Host adapter B- l 10 
thereafter initiates the next restore process (step 2060 in FIG. 20). 

[0152] In the above-described restore process 250, the journal was read from storage 

device 1 50 into cache memory 1 30, but when it is present in cache memory 1 30, this process 
is unnecessary. 

[0153] In the above-described journal read reception process and journal read 

command process 240, a journal to be transmitted by original storage system 100A was 
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determined by pointer information ?OQ70 1 , but secondary storage system 100B may also be 
determined a journal to be transmitted. For example, an update number is added to the 
journal read command. In this case, a retrieval method or a table for finding the logical 
address where the update information has been stored from the update number is provided in 
shared memory 140 of original storage system 100A in order to find the logical address of 
update information with the update number specified by secondary storage system 100B in 
the journal read reception process. 

[0154) In the above-described journal read reception process and journal read 

command process 240, a journal read command was used, but the usual read command may 
be also used. For example, the group information 600 and pointer information 700701 of 
original storage system 100A are transferred in advance into secondary storage system 100B, 
and secondary storage system 100B reads the data (that is, the journal) of the journal logical 
volume 230-4 of original storage system 100A. 

[0155] In the above-described journal read reception process, the explanation was 

conducted with respect to the case in which the journals were transmitted from original 
storage system 100A to -Ssecondary storage system 100B in the order of update numbers. 
However, they may be also transmitted not in the order of update numbers. Furthermore, a 
plurality of journal read commands may be transmitted from original storage system 100A to 
secondary storage system 100B. In this case, a retrieval method or a table for finding the 
logical address where the update information has been stored from the update number is 
provided in secondary storage system 100B in order to process the journals in the order of 
update numbers in the restore process. 

[0156| In the above-described data processing system in accordance with the present 

invention, the original storage system acquires the journals and the secondary storage system 
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replicates the data. As a result, the host computer connected to the original storage system 
carries no load relating to data replication. Furthermore, communication lines of the original 
storage system and the host computer connected to the original storage system are not used 
because the journals are transferred between the original storage system and the secondary 
storage system. 

[0157] FIG. 23 shows a logical configuration of the second embodiment. 

[0158] In this configuration, host computer 1 80 and storage system 1 00C are 

connected by connection path 190; storage system 100C and original storage system 100A 
are connected by connection path 200, and original storage system 100A and secondary 
storage system 1 00B are connected by connection path 200. Storage system 1 00C includes 
logical v olumes 230-9 (O R G1 ) and 2 30-10 ( ORG2), and conducts the data update of ft-logical 
volume 230-9 (ORG1) and the data update of a-logical volume 230-1 (DATA1) in original 
storage system 100A during the data update into the logical volume 230-9 (ORG!) of storage 
system 100C. 

[0159] Original storage system 1 00A, as was described in the first embodiment, 

conducts the retention of journals in a-journal logical volume 230-4 (JNL1) using command 
reception process 210 and read-write process 220 during the data update into the original 
logical volume 230-1 (DATA1) (2310 in FIG. 23). 

[0160] Secondary storage system 100B reads the journal from original storage system 

100A using journal read process 240 described above, and retains the journal in the journal 
logical volume 2 30-8 (JNL2) using read-write process 220 (2320 in FIG. 23). 
[0161] If original storage system 100A receives a command to read a journal from 

secondary storage system 100B, it reads the journal from the journal logical volume 230-4 
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(JNL1) using command reception process 210 and read-write process 220 and transmits the 
journal to secondary storage system 100B (2320). 

|0162] Secondary storage system 100B reads the journal from the journal logical 

volume 230-8 (JNL2) according to the update number using restore process 250 and read- 
write process 220 (both described above) and updates the data of the secondary logical 
volume 230-5 (COPY1), which is the duplicate of the original logical volume 230-1 
(DATA1 ) (2330). Consistency of data between the logical volumes can thus be maintained 
by updating the data in the order of update numbers. 

[0163] In the above-described data processing system in accordance with the present 

invention, the original storage system acquires a journal and stores it in a storage area 
specially designed for journals. Furthermore, the secondary storage system stores the journal 
received from the original storage system in a storage area specially designed for journals. 
The storage area specially designed for journals can be smaller than the storage area of the 
data replication object, and replication of data of the original storage system into the 
secondary storage system can be conducted with a smaller storage capacity. 
[0164] FIG. 24 shows a logical configuration of the third embodiment. 

[0165] In this configuration, host computer 180 and storage system 100C are 

connected by connection path 190; storage system 100C and original storage system 100A 
are connected by connection path 200, and original storage system 100 A and secondary 
storage system 100B are connected by connection path 200. Storage system 100C conducts 
data update of a logical volume 230-9 (ORG1) and data update of a logical volume (DATA1) 
in original storage system 100A during the data update into the logical volume (ORG1) of 
storage system 100C, as described with reference to the conventional technology. 
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[0166| Although original storage system 100A is indicated as having the original 

logical volume (DATA1) to storage system 100C, original storage system does not allocate 
actual storage areas, that is, storage devices 150. That is, no physical storage areas 
corresponding to the volume (DATA1 ) exists in the storage system 1 00A. For example, it 
sets numerical values indicating that storage devices 150 have not been allocated to the 
physical addresses of volume information 400. During the reception of the write command to 
the data of the original logical volume (DATA1 ) from storage system 100C, the original 
storage system 100A does not conduct process step 1270 of command reception process 210 
(described above) and conducts only the retention of journal in the journal logical volume 
230-4 (JNL1) (241 0 in FIG. 24). 

[0167] Secondary storage system 100B reads a journal from original storage system 

100A using journal read process 240 as described above , and retains the journal in the 
journal logical volume 230-8 (JNL2) using read-write process 220 (2420 in FIG. 24). 
[0168] If original storage system 100A receives a command to read a journal from 

secondary storage system 100B, it reads the journal from the journal logical volume 230-4 
(JNL1) using command reception process 210 and read-write process 220 and transmits the 
journal to secondary storage system 100B (2420). 

[0169] Secondary storage system 100B reads the journal from the journal logical 

volume 230-8 (JNL2) according to the update number using restore process 250 described 
above and read write process 220 and updates the data of the secondary logical volume 230- 
5_(COPYl), which is the duplicate of the logical volume (ORG1) (2430). The consistency of 
data between the logical volumes can thus be maintained by updating the data in the order of 
update numbers. 
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[0170] In the above-described data processing system in accordance with the present 

invention, when a failure occurs in storage system 100C or host computer 180 connected to 
storage system 100C, a journal (JNL1) present in original storage system 100A is reflected in 
the logical volume 230-5 (COPY1) of secondary storage system 100B, thereby making it 
possible to refer to and to update the newest data with the host computer connected to 
secondary storage system 100B. Furthermore, a storage capacity necessary for data 
replication can be decreased by storing only the journals, without holding data replication in 
original storage system 1 00A. 

[0171 ] In the above-described data processing system in accordance with the present 

invention, secondary storage system 100B reads (pulls) the journal from original storage 
system 1 00A. But, the present invention also works well when original storage system 1 00A 
transfers the journal to secondary storage system 100B at a timing given for the next journal 
command, as described above. 

[0172j While the invention created by the inventors has been described in terms of 

the preferred embodiments, those embodiments have been disclosed for illustrative purposes 
and those skilled in the art will appreciate that ttehthe invention is not limited to the above- 
described embodiments and various modifications, additions and substitutions are possible 
without departing from the scope and spirit of the invention as disclosed in the accompanying 
claims. 

In accordance with the present invention, a storage system can be provided that is capable of 
performing data transmission or data replication among a plurality of storage systems without 
affecting host computers in the storage systems. Further, a storage system can be provided 
that does not affect communication between the storage system and computers. 
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|0173l In accordance with the present invention, the data storage areas held in a 

plurality of storage systems can be reduced. Furthermore, data transfer or data replication 
between a plurality of storage systems can be conducted effectively and at a high speed 
producing no effect on the operation of the host computer of the plurality of storage systems. 
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